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EXHI BITA 



TSG101 may be the prototype of a class of 
dominant negative ubiquitin regulators 



The TSGlOl gene has been implicated 
in sporadic breast cancer'. Knockout 
of the murine orthologue resulted in fibro- 
blast transformation and ability to form 
tumours*. The TSGlOl protein contains a 
proline-rich region and a predicted coiled- 
coil domain, but no functional clues have 
been extracted from the sequence'*-. We 
show that the N-terminal domain of 
TSGlOl belongs to a group of apparendy 
inactive homologues of ubiquitin-conju- 
gating enzymes (E2), which may be domi- 
nant negative regulators of E2 activity in 
cell-cycle control. 

In database searches, non-globular pro- 
tein domains produce numerous hits to 
proteins with biased composition, obscur- 
ing important but subde signals^*"*. When 
the non-redundant protein database at the 
NCBI was screened with the sequence of the 
TSGlOl protein using the BLAST2 pro- 
gram^, similarity was detected only to pro- 
line-rich and coiled-coil proteins. By 



contrast, the predicted N-terminal, globu- 
lar^ domain of TSGlOl ( 134 amino acids) 
showed significant similarity (P< 10^) to 
the hypothetical yeast protein YCLOOSc, a 
homologue of E2 ubiquitin-conjugating 
enzymes^. This search also detected simi- 
larity to several E2 sequences, but it was not 
statistically significant. YCLOOSc, however, 
is significandy similar to E2 {P< 10^^). estab- 
lishing a link between TSGlOl and ubiqui- 
tin-conjugating enzymes. This connection 
was supported by multiple alignment con- 
structed using the MACAVV program^. The 
alignment includes four blocks conserved 
in TSGlOl, YCLOOSc and E2, widi P< 10"'^ 
for three of them (Fig. 1). The active cys- 
teine, which in E2 is conjugated with ubiq- 
uitin*^''**, is replaced by tyrosine in TSGlOl 
and by alanine in Y'CLOOSc, though the sur- 
rounding motif is conserved (Fig. I). Data- 
base searches with E2 sequences identified 
another yeast protein and a nematode pro- 
tein similar to E2 but containing tyrosine or 



glutamine instead of the active cysteine 
(Fig. I). The putative inactive E2 homo- 
logues contain an additional conserved 
motif, not shared by active ubiquitin-con- 
jugating enzymes, which may be implicated 
in a common function (Fig. I), 

The three-dimensional structures of E2 
enzymes firom Arabidopsis and yeast are 
available'**'-. The alignment of TSGlOl 
with E2 includes all the structural elements 
of the latter, except for the two C-terminal 
a-helices, which are replaced by the pro- 
line-rich domain in TSGlOl (Fig. I). A sta- 
tistical test'^ indicated high likelihood of 
structural similarity between TSGlOl and 
the ubiquitin-conjugating enzymes 
(P< 10-^). A 3-D model for TSGlOl, based 
on the alignment with E2, was constructed 
and energy-minimized using the ICM 
mediod{Fig. 2)'l 

The ubiquitin system is implicated in 
several stages of cell-cycle control '^ The 
ubiquitin-dependent degradation of the 
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Fig. 1 Multiple align- 
ment of TSGlOl and 
other E2 derivatives 
with replaced active 
cysteine and bona-fide 
ubiquitin-conj ugati ng 
enzymes. The upper 
block of seven 
sequences includes 
ubiquitin-conjugating 
enzyme homologues 
with replaced artive 
cysteine, and the 
lower block of six 
sequences consists of 
diverse, aaive ubiqui- 
tin<onjugating 
enzymes, including two for which the 3-D structure has 
been determined: UBC4_YEAST (pdb 2UCE). 
U8C1_ARATH (pdb lAAK) and a cycl in-specific enzyme 
from the clam Spisula solidissima (C5-E2C). The 
sequences are identified by their SWISS-PROT names 
(underlines) if available, and the gene identification 
numbers in the NR database are indicated in the far- 
right column. The sequence of YO-OOSc is different from 
that in the database, as a fragment of amino-actd 
sequence with similarity to TSG101 was detected in a 
different frame upstream of the annotated coding 
region. With the assumption that this portion of the 
gene was separated from the downstream portion 
because of a frameshift error, the putative complete 
protein (203 amino acids) was reconstructed from two 
overlapping database sequences (GenBank accessions 
S61879 and X59720). Two partial ESTnencoded sequences with strong similarity to TSGlOl from mouse (GenBank AA199318) and Arabidopsis 
(GenBank T76298) are included. The additional motif shared by T5G101 and other proteins with replaced active cysteine (but not the aaive E2 
enzymes) is overiined. The distances between the aligned blocks and the distances from the protein termini are indicated by numbers. The consensus 
includes amino acid residues conserved in most of the aligned sequences; the residues that conform with the consensus are shown by reverse typing. 
U. bulky hydrophobic residue (I.L.V.M,F,Y,W); O, aromatic residue; S, serine or threonine; -. negatively charged residue; +. positively charged residue; 
dots indicate any residue. The ubiquitin-conjugating cysteine is indicated by an asterisk. The alignment downstream of the active <^eine was modi- 
fied manually to align the tryptophan residue, which is conserved in all available sequences of E2 enzymes. The experimentally determined secondary 
structure elements for UBC4_YEAST and UBC1_ARATH are shown below the alignment h, a-helix; e. extended conformation (j^sheet); I, loop. 
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Yeast Ubiquitin Conjugating Enzyme (ubc4_yeast, 2uce) 




by the ubiquitin cascade-***^'. Cyclin- 
specific E2 enzymes have been iso- 
lated^-^^; their derivatives with serine 
substituted for the active cysteine are 
dominant negative inhibitors of cyclin 
destruction and block cells in meta- 
phase-"*. TSG 101 -like proteins may 
belong to a novel class of cell-cycle reg- 
ulators, intercepting the ubiquitin cas- 
cade by forming non-productive 
complexes with ubiquitin, target pro- 
teins or other components of the ubiq- 
uitin machinery. 

Note added in proof: We have 
recendy become aware of similar con- 
clusions independently reached by 
odiers (Ponting. CP.. Cai, Y.-R. Bork, 
P.J. Mol Med,, in the press). 
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Fig. 2 

Comparison 
of the 3-0 
structure of a 
yeast ubiqui- 
tin<onju gat- 
ing enzyme and 
a structural model 
for the globular 
domain of human 
TSGIOI. a, 3-0 structure of yeast UBC4 
(ref. 12), The residues that are conserved in most of 
the E2 enzymes are indicated. The aaive-site loop 
starts with G83; the active-site cysteine (C85) is 
always flanked by two .hydrophobic residues (185 
and L87), which anchor C86 to the core and lock 
the orientation of its side chain. The loop 
between C86 and W94, a residue conserved in all 
known E2 sequences, may vary in length in different groups of enzymes (Fig. 1). b. Hypothetical 
model of the TSGlOl structure based on the alignment in Fig. 1 and the UBC4 structure. The 
model was free of steric clashes (van der Waals energy -608 kcal/mol) and contained only one 
medium size (63 cavity. All the essential residues spatially adjacent to the active cysteine in E2 
are conserved in TSGIOI. The conserved tryptophan and the adjacent residues form a hydropho- 
bic cluster supporting the active site geometry. Thus, the active site loop in TSGIOI is likely to 
have the same conformation as in E2 despite the replacement of the active cysteine. The two 
regions that could not be reliably aligned (Fig. 1) and the two C-terminal helices, which are 
apparently missing in TSGIOI, do not contaa the aaive site and are unlikely to affect ligand 
binding. Despite the low sequence similarity in the N-terminal helix, a consen/ed hydrophobic 
contact (I7-P62 and \/7-P60 in UBC4 and TSGIOI. respectively) could be proposed. A colour gradi- 
ent from blue (N terminus) to red (C terminus) is used in both images. The images were generat- 
ed with I CM software^. 



cyclin-dependent kinase inhibitor SIC2 is 
critical for the Gl-S transition in yeast 
Ubiquitin-mediated destruction of the 
anaphase inhibitor PDSl and of cyclin B is 



required for the metaphase-anaphase pro- 
gression and for the teiophase-Gl progres- 
sion, respectively*'"*'^. The checkpoint 
proteins p53 and NF-kB are also degraded 



1. Li. U Li. X, Francke, U. & Cohen, S.N. Celt 88, 
143-154(1997). 

2. Li. L & Cohen. S.N. Ce//85. 319-329(1996). 

3. Attschul. S.F.. Boguski, M.S.. Gish, W. A Wootton, 
J.C Nature Genet. 6. 1 19-129 (1994). 

4. Koonin. E.V.. Attschul, S.F. & Bork. P. Nature Genet 
13. 266-268(1996). 

5. Altschut. S.F. & Gisk W. Meth. Enzymol. 266, 
460-480(1996). 

6. Woottoa J.C & Federhen, S. Met/i. Enzymol. 266. 
554-571 (1996). 

7. Koonin. E.V.. Bork. P. & Sander. C. EMBO / 13, 
493-503 (1994). 

8. Schuler. G.D.. Attschul. S.F. & Upman. OJ. Proteins 
Struct Funct Genet 9. 180-190 (1991). 

9. Sung. P.. Prakash. S. & Prakash. L J. Mot. Biol. 221. 
745-749(1991). 

10. Sullivan. M.L & Vientra. R.D. J. Biol. Chem. 266. 
23878-23885(1991). 

11. Cook. WJ.. Jeffrey; LC Sullivan, M.L & Vierstra. 
R.D./B/o/. Chem. 267. 15116-15121 (1992). 

12. Cook. WJ.. Jeffreyt LC. Xu. Y. & Chau. V. 
Biochemistry^ 13809-13817(1993). 

13. Abagyan, R.A. & Batalov, $.N. J, Mot. Biol, (in press). 

14. Abagyan, R.A.. Totrmt M.M. & Kuznetso\^ D.A. J. 
Comp. Chem. 15. 488-506 (1994). 

15. King. R.W.. Deshates, RJ.. Peter\ J.M. & Kirschner, 
M.W. Saence 274. 1 652-1659 (1 996). 

16. Schwob. E.. Bohm. T. Mendenhall. M.D. & Nasmyth, 
K. Celt 79. 233-244 ( 1 994). 

17. Cohen-Fix, O.. Peters. J.M.. Kirschner, M.W. & 
Xoshland. D. Genes Dev. 10. 3081-3093 (1 996). 

18. Glouer. M.. Murray. A.W. & Kirschner. M.W. Nature 
349. 132-138(1991). 

19. Holtowa); S.L., Glot2er. M.. King. R.W. & Mun^ay, 
A.W. Ce/i73. 1393-1402 (1993). 

20. Scheffner, M, Huibregtse. J.M.. Vierstra. R.O. & 
Howley, P.M. Cell 75. 495-505 (1993). 

21. Palombella, VJ., Rando. OJ.. Goldberg, A.L A 
Maniatis. T. Cell 78, 773-785 (1 994). 

22. Yu. H, King. R.W, Peters, J.M. & Kirschner. M.W. 
Cum Biol. 6, 455-466 (1996). 

23. Aristarkho>^ A. et a/. Proc. Natl. Acad. Sd. USA 93. 
4294-4299(1996). 

24. Townste^i F.M.A. et al. Proc Natl. Acad. Sd. USA 94. 
2362-2367(1997). 

25. Molsoft. LLC tCN Software Manual, version 2.6 
(httpi/Awwvw.motsoftcom) (19%). 



nature genetics volume 16 august 1997 



331 



